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1 Complete removal of redundant expressions 

Rastislav Bodik, Rajiv Gupta, Mary Lou Soffa 

May 1998 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1998 conference 
on Programming language design and implementation, volume 33 issue 5 

Additional Information: full citation , abstract , references , citings , index 

terms 

Partial redundancy elimination (PRE), the most important component of global optimizers, 
generalizes the removal of common subexpressions and loop-invariant computations. 
Because existing PRE implementations are based on code motion, they fail to completely 
remove the redundancies. In fact, we observed that 73% of loop-invariant statements 
cannot be eliminated from loops by code motion alone. In dynamic terms, traditional PRE 
eliminates only half of redundancies that are strictly partial. ... 

Keywords: control flow restructuring, demand-driven frequency data-flow analysis, partial 
redundancy elimination, profile-guided optimization, speculative execution 




2 Global optimizations for parallelism and locality on scalable parallel machi nes 

Jennifer M. Anderson, Monica S. Lam 

June 1993 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1993 conference 

on Programming language design and implementation, volume 28 issue 6 
Full text available: ^ pdf(1.74 MB) Additional Information: full citation , references , citings, index terms 



3 An affine partitioning algorithm to maximize parallelism and minimize communi cation 

Amy W. Lim, Gerald I. Cheong, Monica S. Lam 

May 1999 Proceedings of the 13th international conference on Supercomputing 

Full text available: * g pdf(1.33 MB) Additional Information: full citation , re ferences , citings, index terms 



4 Supporting the restructuring of da ta abstractions through manipulation of a program 
visualization 

Robert W. Bowdidge, William G. Griswold 

April 1998 ACM Transactions on Software Engineering and Methodology (TOSEM), 

Volume 7 Issue 2 
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Full text available: ^ pdf(1.57 MB) Additional Information: full citation , abstract , references , citing s, index 

terms 

With a meaning-preserving restructuring tool, a software engineer can change a program's 
structure to ease future modifications. However, deciding how to restructure the program 
requires a global understanding of the program's structure, which cannot be derived easily 
by directly inspecting the source code. We describe a manipulate program visualization— 
the star diagram— that supports the restructuring task of encapsulating a global data 
structure. The star diag ... 

Keywords: meaning-preserving restructuring, semi-automated restructuring, software 
visualization, star diagram, tool-supported restructuring 



Compiler-directed selection of dynamic memo r y layouts 

Mahmut Kandemir, Ismail Kadayif 

April 2001 Proceedings of the ninth international symposium on Hardware/software 
codesign 

Full text available: pdf(650.29 KB) Additional Information: full citation , abstract , references , index terms 

Compiler technology is becoming a key component in the design of embedded systems, 
mostly due to increasing participation of software in the design process. Meeting system- 
level objectives usually requires flexible and retargetable compiler optimizations that can be 
ported across a wide variety of architectures. In particular, source-level compiler 
optimizations aiming at increasing locality of data accesses are expected to improve the 
quality of the generated code. Previous compiler-based ap ... 

Keywords: array reuse, data dependence, data locality, memory layout optimization, 
software compilation 




6 Interprocedural transformations for parallel code generation 

Mary W. Hall, Ken Kennedy, Kathryn S. McKinley 

August 1991 Proceedings of the 1991 ACM/IEEE conference on Supercomputing 

Full text available: pdf(1.26 MB) Additional Information: full citation , references , citings , index terms 




Load-reuse analysis: design and evaluation 

Rastislav Bodik, Rajiv Gupta, Mary Lou Soffa 

May 1999 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1999 conference 
on Programming language design and implementation, volume 34 issue 5 

r „, . , 0 Q ~ ..p., Additional Information: full citation , abstract , references , citings , index 

Full text available: Tq pd f(186 MB} ' *~ 

^ terms 

Load-reuse analysis finds instructions that repeatedly access the same memory location. 
This location can be promoted to a register, eliminating redundant loads by reusing the 
results of prior memory accesses. This paper develops a load-reuse analysis and designs a 
method for evaluating its precision. In designing the analysis, we aspire for completeness— 
the goal of exposing all reuse that can be harvested by a subsequent program 
transformation. For register promotion, a suitable transfo ... 

Keywords: data-flow analysis, profile-guided optimizations, program representations, 
register promotion 
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R. W. Schwanke, M. A. Platoff 

October 1989 ACM SIGSOFT Software Engineering Notes , Proceedings of the 2nd 

International Workshop on Software configuration management, volume 14 

Issue 7 

Full text available: f Ppdfd.14 MB) Additional Information: full citation , references , citings , index terms 



Reducing branch costs via branch alignment 

Brad Calder, Dirk Grunwald 

November 1994 Proceedings of the sixth international conference on Architectural 

support for programming languages and operating systems, volume 29 , 

28 Issue 11 , 5 

p- ^ -. a _.r/^A»/.r». Additional Information: full citation , abstract, references , citings, index 
Full text available: *gjpdf(1.20 MB) terms 

Several researchers have proposed algorithms for basic block reordering. We call these 
branch alignment algorithms. The primary emphasis of these algorithms has been on 
improving instruction cache locality, and the few studies concerned with branch prediction 
reported small or minimal improvements. As wide-issue architectures become increasingly 
popular the importance of reducing branch costs will increase, and branch alignment is one 
mechanism which can effectively reduce ... 

Keywords: branch prediction, branch target buffers, profile-based optimization, trace 
scheduling 




10 A software engineering perspective on algorithm ics 

Karsten Weihe 

March 2001 ACM Computing Surveys (CSUR), volume 33 issue i 

r „ A ^ , ^ ^ ud. Additional Information: full citation , abstract , references , index terms , 
Full text available: fgl pdf(162 MB) 

^ review 

An algorithm component is an implementation of an algorithm which is not intended to be a 
stand-alone module, but to perform a specific task within a large software package or even 
within several distinct software packages. Therefore, the design of algorithm components 
must also incorporate software-engineering aspects. A key design goal is adaptability. This 
goal is important for maintenance throughout a project, prototypical development, and 
reuse in new, unforseen contex ... 

Keywords: algorithm engineering 




11 Reverse engineering: Search based reverse engineering 

Brian S. Mitchell, Spiros Mancoridis, Martin Traverso 

July 2002 Proceedings of the 14th international conference on Software engineering 
and knowledge engineering 

Full text available: ^ pdf(208.64 KB) Additional Information: full citation , abstract , references 

In this paper we describe a two step process for reverse engineering the software 
architecture of a system directly from its source code. The first step involves clustering the 
modules from the source code into abstract structures called subsystems. The second step 
involves reverse engineering the subsystem-level relations using a formal (and visual) 
architectural constraint language. We use search techniques to accomplish both of these 
steps, and have implemented a suite of integrated tools to ... 



Combinational lo gic synthesis for LUT based field programmable gate arrays 
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Jason Cong, Yuzheng Ding 

April 1996 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 1 Issue 2 

^ ~* „™ Additional Information: full citation , abstract , references , citings, index 

Full text available: Q pdf(628.91 KB) teTml^eview 

The increasing popularity of the field programmable gate-array (FPGA) technology has 
generated a great deal of interest in the algorithmic study and tool development for FPGA- 
specific design automation problems. The most widely used FPGAs are LUT based FPGAs, in 
which the basic logic element is a K-input one-output lookup-table (LUT) that can implement 
any Boolean function of up to K variables. This unique feature of the LUT has brought new 
challenges to lo ... 

Keywords: FPGA, area minimization, computer-aided design of VLSI, decomposition, delay 
minimization, delay modeling, logic optimization, power minimization, programmable logic, 
routing, simplification, synthesis, system design, technology mapping 



13 SilkRoute: A framework for publishing relational data in XML 

Mary Fernandez, Yana Kadiyska, Dan Suciu, Atsuyuki Morishima, Wang-Chiew Tan 
December 2002 ACM Transactions on Database Systems (TODS), volume 27 issue 4 

Additional Information: full citation , abstract , references , citings , index 




Full text available: ■ g) pdf(687.91 KB) terms 

XML is the "lingua franca" for data exchange between interenterprise applications. In this 
work, we describe SilkRoute, a framework for publishing relational data in XML. In 
SilkRoute, relational data is published in three steps: the relational tables are presented to 
the database administrator in a canonical XML view; the database administrator defines in 
the XQuery query language a public, virtual XML view over the canonical XML view; and an 
application formulates an XQuery query over the publ ... 

Keywords: XML, XML storage systems, XQuery 




14 All things UML: Nice class diagrams admit good design? 

Holger Eichelberger 

June 2003 Proceedings of the 2003 ACM symposium on Software visualization 

Full text available: ' gj pdf(413.61 KB) Additional Information: full citation , abstract , references , index terms 

Analysis and design of programs by using tools has emerged to a standard technique in 
object-oriented software engineering. Many of these tools claim to implement methods 
according to the UML standard and some of the tools provide automatic layout of the 
diagrams drawn by the user or generated automatically from source code. In this paper we 
propose a set of aesthetic criteria for UML class diagrams and discuss the relation between 
these criteria, HCI and design aspects of object-oriented softwa ... 

Keywords: HCI, UML class diagrams, aesthetics, metrics, software engineering 



15 Cache-conscious data placement 

Brad Calder, Chandra Krintz, Simmi John, Todd Austin 

October 1998 Proceedings of the eighth international conference on Architectural 

support for programming languages and operating systems, volume 33 , 32 

Issue 11 , 5 

^ ~ Additional Information: full citation , abstract , references , citings, index 

Full text available: || pdf(1.49 MB) terms 

As the gap between memory and processor speeds continues to widen, cache eficiency is an 
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increasingly important component of processor performance. Compiler techniques have been 
used to improve instruction cache pet$ormance by mapping code with temporal locality to 
different cache blocks in the virtual address space eliminating cache conflicts. These code 
placement techniques can be applied directly to the problem of placing data for improved 
data cache pedormance.In this paper we present a gene ... 

16 Technical reports 

SIGACT News Staff 

January 1 980 ACM SIGACT News, volume 12 issue 1 
Full text available: pd f(5.28 MB) Additional Information: full citation 




17 A dynamic locality optimization algorithm for linear algebra codes 

Mahmut Kandemir 

March 2001 Proceedings of the 2001 ACM symposium on Applied computing 

Full text available: 'PI pdff 107.74 KB) Additional Information: full citation , references , citings , index terms 



Keywords: cache locality, data reuse, dynamic optimizations, memory layouts, optimizing 
compilers 



18 Efficient scheduling of conditional behaviors for high-level synthesis 

Apostolos A. Kountouris, Christophe Wolinski 

July 2002 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 7 Issue 3 
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Full text available: Tn pdf (1.50 MB) 




terms 

As hardware designs get increasingly complex and time-to-market constraints get tighter 
there is strong motivation for high-level synthesis (HLS). HLS must efficiently handle both 
dataflow-dominated and controlflow-dominated designs as well as designs of a mixed 
nature. In the past efficient tools for the former type have been developed but so far HLS of 
conditional behaviors lags behind. To bridge this gap an efficient scheduling heuristic for 
conditional behaviors is presented. Our heuristic a ... 

Keywords: Design automation, conditional behavior, high level synthesis (HLS), scheduling 



19 A compiler technique for improving whole-program locality 

Mahmut Taylan Kandemir 

January 2001 ACM SIGPLAN Notices , Proceedings of the 28th ACM SIGPLAN-SIGACT 

symposium on Principles of programming languages, volume 36 issue 3 
Full text available: " gpdf(1.37 MB) Additional Information: full citation , abstract , references , index terms 

Exploiting spatial and temporal locality is essential for obtaining high performance on 
modern computers. Writing programs that exhibit high locality of reference is difficult and 
error-prone. Compiler researchers have developed loop transformations that allow the 
conversion of programs to exploit locality. Recently, transformations that change the 
memory layouts of multi-dimensional arrays— called data transformations — have been 
proposed. Unfortunately, both data and loop transformations have ... 

Keywords: cache locality, data reuse, memory layouts, optimizing compilers, static 
optimizations 
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20 NanoFabrics: spatial computing using molecular electronics 

Seth Copen Goldstein, Mihai Budiu 

May 2001 ACM SIGARCH Computer Architecture News , Proceedings of the 28th 

annual international symposium on Computer architecture, volume 29 issue 2 

Full text available: ff?) pdf(996.26 KB) Additional Information: full citation , abstract , references , citings, index 
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The continuation of the remarkable exponential increases in processing power over the 
recent past faces imminent challenges due in part to the physics of deep-submicron CMOS 
devices and the costs of both chip masks and future fabrication plants. A promising solution 
to these problems is offered by an alternative to CMOS-based computing, chemically 
assembled electronic nanotechnology (CAEN). 

In this paper we outline how CAEN-based computing can become a reality. We briefly 
describe rec ... 
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riotieor.berkeley.edu/ riot/Applications/WeightedMinCut/ - 8k - Cached - Similar pages 
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... The bottleneck graph partition problem is to partition the nodes of a graph into 
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An Interior-Point Method for Semidefinite Programming - Helmberg, RendL (1996) (Correct) (131 citations) 
Problem The max-cut problem is the problem of partitioning the node set of an edge-weighted undirected 
problem of partitioning the node set of an edge-weighted undirected graph into two parts so as to 
We show that the approach is very efficient for graph bisection problems, such as max-cut. Other 
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the following k- clustering problem: given a weighted graph G =X d) on N vertices, where d(Delta 
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Minimum Cuts in Near-Linear Time - Karger (1998) (Correct) (22 citations) 

and m (possibly weighted) edges, we wish to partition the vertices into two non-empty sets so as to 
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